Compressed Multiple-Row Storage Format

نویسندگان

  • Zbigniew Koza
  • Maciej Matyka
  • Sebastian Szkoda
  • Lukasz Miroslaw
چکیده

A new format for storing sparse matrices is proposed for efficient sparse matrix-vector (SpMV) product calculation on modern throughput-oriented computer architectures. This format extends the standard compressed row storage (CRS) format and is easily convertible to and from it without any memory overhead. Computational performance of an SpMV kernel for the new format is determined for over 140 sparse matrices on two Fermi-class graphics processing units (GPUs) and the efficiency of the kernel, which peaks at 36 and 25 GFLOPS at single and double precision, respectively, is compared with that of five existing generic algorithms and industrial implementations. The efficiency of the new format is also measured as a function of the mean (μ) and of the standard deviation (σ) of the number of matrix nonzero elements per row. The largest speedup is found for matrices with μ & 20 and μ & σ & 1.5 and can be as high as 43%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hilbert-order multiplication scheme for unstructured sparse matrices

We investigate a new storage format for unstructured sparse matrices, based on the space filling Hilbert curve. Numerical tests with matrix-vector multiplication show the potential of the fractal storage format (FS) in comparison to the traditional compressed row storage format (CRS). The FS format outperforms the CRS format by up to 50% for matrix-vector multiplications with multiple right han...

متن کامل

Vectorized Sparse Matrix Multiply for Compressed Row Storage Format

The innovation of this work is a simple vectorizable algorithm for performing sparse matrix vector multiply in compressed sparse row (CSR) storage format. Unlike the vectorizable jagged diagonal format (JAD), this algorithm requires no data rearrangement and can be easily adapted to a sophisticated library framework such as PETSc. Numerical experiments on the Cray X1 show an order of magnitude ...

متن کامل

Blocked-based sparse matrix-vector multiplication on distributed memory parallel computers

The present paper discusses the implementations of sparse matrix-vector products, which are crucial for high performance solutions of large-scale linear equations, on a PC-Cluster. Three storage formats for sparse matrices compressed row storage, block compressed row storage and sparse block compressed row storage are evaluated. Although using BCRS format reduces the execution time but the impr...

متن کامل

Data Structures and Algorithms for Distributed Sparse Matrix Operations

We propose extensions of the classical row compressed storage format for sparse matrices. The extensions are designed to accomodate distributed storage of the matrix. We outline an implementation of the matrix-vector product using this distributed storage format, and give algorithms for building and using the communication structure between processors.

متن کامل

Comparative Analysis of Sparse Matrix Algorithms

We evaluate and compare the storage efficiency of different sparse matrix storage formats as index structure for text collection and their corresponding sparse matrixvector multiplication algorithm to perform query processing in information retrieval (IR) application. We show the results of our implementations for several sparse matrix algorithms such as Coordinate Storage (COO), Compressed Spa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1203.2946  شماره 

صفحات  -

تاریخ انتشار 2012